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Abstract 

Species invasions can have wide-ranging biological and socio-economic effects 
and are generally unwanted by legislation. Identification of the source popula- 
tion as well as the ecology and genetics of both the invader population and the 
receiving community is of crucial importance. The rapid invasion of a small 
coregonid fish vendace (Coregonus albula) in a major northern European sub- 
arctic watercourse has resulted in a labile ecological situation in the receiving 
community. The ecological impact of the invasion has been thoroughly docu- 
mented, but the genetics of the invasion remains to be explored. We analyzed 
the genetic diversity and divergence patterns among the two possible source 
populations from southern Finnish Lapland and three colonists populations 
within the Inari-Pasvik watercourse using ten microsatellite loci in order to (i) 
identify the most likely source of the invasion, (ii) reveal the dispersal pattern 
and genetic structure of the secondary expansion, and (iii) to investigate 
whether the initial introduction and the secondary expansion were associated 
with founder effects. We revealed that repeated translocation of vendace from 
Lake Sinettajarvi into a tributary lake of L. Inari in 1964-1966 is the most plau- 
sible source for the invasion. Both the initial introduction and the secondary 
expansion were found not to be associated with significant founder effects. The 
secondary expansion followed a stepping stone pattern and the source and colo- 
nist populations of this expansion have undergone rapid genetic divergence 
within a period of 15-35 years (ca. 8-17 generations). The rapid divergence 
may be contributed to lack of gene flow among the source and colonist popula- 
tions due to the extensive hydroelectric damming in the watercourse. Multiple 
introductions and substantial genetic variation in combination with the boom- 
and-bust population development of the species thus likely counteracted the 
founder effects as well as fueled the rapid establishment and expansion of this 
species within the Inari-Pasvik watercourse. 



Introduction 

Introductions and invasions of exotic species represent a 
problem of global extent (Vitousek et al. 1996; William- 
son 1996; Olden and Rooney 2006). Severe ecological 
effects may arise from invasions or possible interactions 
between climate change and invasions (Sandlund et al. 
1999; Mooney and Hobbs 2000; Driscoll et al. 2012), and 
the consequences may also be associated with huge 
economical costs (Moyle 1986; Mack et al. 2000; Pimentel 
et al. 2000). During recent decades, numerous studies 



have addressed the ecological effects of biological inva- 
sions, particularly with respect to the consequences for 
the receiving communities. Negative impacts include 
imperilment of native species (Allan and Flecker 1993; 
Peterson et al. 2004; Bohn et al. 2008), alterations of 
community structure and natural biodiversity (William- 
son 1996; Parker et al. 1999), and a worldwide biotic 
homogenization (Rahel 2000, 2002; Olden and Poff 2004; 
Olden and Rooney 2006). However, species introductions 
may also be seen as large-scale ecological experiments, 
and provide unique insight into ecological interactions as 
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well as population biology and genetics (Sakai et al. 2001; 
Shea and Chesson 2002; Sax et al. 2007; Davis 2009). 

The genetic consequences and outcome of an invasion 
invent for the invaders and the colonist populations, that 
is, the correlation between propagule pressure, genetic 
variability of the invaders and the resulting genetic vari- 
ability in the colonist populations, are not trivial. Colo- 
nist populations, that is, the populations descending from 
introduction events, may rapidly develop differences in 
morphology and phenotypes (Facon et al. 2008; Ward 
et al. 2012) and in ecology (Sandlund 1992; Sakai et al. 
2001; Amundsen et al. 2012) as compared to the source 
population, but the genetic component is often unknown 
(e.g., Blanchet 2012). Colonist populations can give rise 
to secondary expansions, that may cover large geographi- 
cal scales (Facon et al. 2008; Brown and Stepien 2009; 
Reusch et al. 2010). Such demographic expansions propel 
divergent adaptive pressures due to exploration of new 
environments (Ghalambor et al. 2007), which again may 
promote rapid changes in life history of the invader (Dlu- 
gosch and Parker 2008a; Amundsen et al. 2012; Gutowsky 
and Fox 2012). While most studies have found lower 
genetic variation in the colonist compared to the source 
populations (Henry et al. 2009; Ayres et al. 2010; but see 
Dlugosch and Parker 2008b for review), others show no 
loss of genetic variation (Novak and Mack 2005; Stepien 
et al. 2005; Wares et al. 2005). Some studies even find 
increased genetic variation in the colonist populations 
when they originate from multiple introductions promot- 
ing genomic admixture (Golubtsov et al. 1993; Kolbe 
et al. 2004; Lavergne and Molofsky 2007). The fraction of 
genetic variation transferred from the source populations 
to the invaders depends on propagule pressure (Lock- 
wood et al. 2005; Colautti et al. 2006; Simberloff 2009), 
dispersal mode (Hewitt 1996; Ibrahim et al. 1996; Wilson 
et al. 2009), and/or route of invasion (Estoup and Guille- 
maud 2010). The altered selection pressure applied when 
individuals explore new geographical ranges also repre- 
sents challenges for the invader (Lee 2002; Suarez and 
Tsutsui 2008). Thus, it is the combined effect of these 
factors that determine whether the invaders are successful 
in their colonization and proliferation. 

The present study concerns the introduction and subse- 
quent invasion of an exotic fish species (vendace, Coregonus 
albula, Fig. 1) into the subarctic Inari-Pasvik watershed 
(Norway, Finland, and Russia). The first introduction 
occurred in 1956, when some vendace fry were translocated 
for stocking purposes from Lake Kelujarvi in central Fin- 
land to the Inari Hatchery, from where they subsequently 
escaped into Lake Inari (Mutenia and Salonen 1992; see 
also Fig. 2). The second introduction occurred in 1964- 
1966 when vendace fry were translocated from Lake 
Sinettajarvi to a small lake in the catchment area of Lake 




Figure 1. Invading vendace (Coregonus albula) from the Pasvik-lnari 
watercourse, Norway (photo: K. 0. Gjelland). 

Inari. The first vendace were observed in Lake Inari in 
1973. By the early 1980s, a vendace population had estab- 
lished throughout the lake (Mutenia and Salonen 1992), 
increasing to a peak abundance in 1989 (Salonen 1998, 
2004). During this period a downstream invasion of vend- 
ace apparently occurred to lakes in the Pasvik watercourse, 
the outlet river from Lake Inari, where the species was 
observed for the first time in 1989 (Amundsen et al. 1999). 
The vendace were observed in L. Vaggatem for the first 
time in 1991 and in L. Skrukkebukta in 1993, and within 
few years the invader became an important pelagic fish spe- 
cies in lakes in the Pasvik watercourse (Bohn et al. 2004, 
2008). Severe ecological consequences have been docu- 
mented for the receiving system, including changes in 
biodiversity and species composition, food web dynamics 
and ecosystem functioning (e.g., Bohn and Amundsen 
1998, 2001; Amundsen et al. 2003). The vendace have dur- 
ing the invasion entered a typical fluctuating 'boom-and- 
bust' development (Salonen et al. 2007), resulting in a 
destabilized aquatic ecosystem. 

Although documentation of the ecological impact of the 
vendace invasion is accumulating, the genetics of the inva- 
sion remains to be addressed. A recent study revealed 
reproductive isolation between vendace in L. Inari and a 
downstream lake (Amundsen et al. 2012), which initiated 
this more detailed genetic study of the system. Because we 
hold detailed temporal and ecological information about 
the invasion process (see e.g., Mutenia and Salonen 1992; 
Gjelland et al. 2007; Bohn et al. 2008) and have the special 
opportunity to precisely identify the source of the invasion, 
the system offers an excellent model system to assess 
ongoing genetic and evolutionary changes. We also expect 
the downstream invasion to be highly influenced by the 
seven hydropower dams that exist in the watercourse 
(Amundsen et al. 2012). The dams only allow unidirec- 
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Figure 2. Origin and sample locations (a) and 
molecular genetic variation (b, c) of vendace in 
northern Fennoscandia. In (a) light gray arrows 
show the direction of drainage. Also, note that 
hydropower dams (indicated by black lines in 
the map) are located between the introduced 
L. Inari population (In) and the populations 
within the secondary expansion (Vg and Sb) 
limiting downstream gene flow and ruling out 
any upstream gene flow. The schematic 
illustration of the invasion process following a 
stepping stone (b) and normal dispersal pattern 
(c) are illustrated with the corresponding pie 
charts of the relative allelic frequencies of 
three representative microsatellite loci 
(ClaTet13, Cocl-lav49, and ClaTet06). Dashed 
lines indicate man-made translocations 
(L. Sinettajarvi and L. Kelujarvi to L. Inari) and 
full black lines delineate the natural secondary 
expansion for the invasion. Below are the color 
codes for the alleles per locus. Sample 
locations are abbreviated according to Table 1. 



tional downstream gene flow that would be limited or 
absent during normal river flow conditions, as the fish 
larvae or adults would need to pass through dam turbines 
(see also Amundsen et al. 1999, 2012). Hence, potential 
factors that may limit the successful colonization in the sys- 
tem may relate both to founder effects and to unidirec- 
tional gene flow. Here, we used ten microsatellites to 
explore the genetics of the introduction and subsequent 
downstream migration of vendace in the watercourse by 
comparing the two potential source populations, the intro- 
duced population, and two downstream colonist popula- 
tions originating from the secondary expansion. The main 
objectives of the study were to (i) identify the most likely 
source of the invasion, (ii) reveal the dispersal pattern and 



genetic structure of the secondary expansion, and (iii) to 
investigate whether the initial introduction and the second- 
ary expansion were associated with founder effects. 

Materials and Methods 

Study lakes and sample collection 

To identify the most likely origin of the invasion, vendace 
were sampled from the introduced population in L. Inari, 
northern Finland, and the two potential source popula- 
tions, L. Sinettajarvi and L. Kelujarvi, central Finland 
(Table 1; Fig. 2). Vendace were furthermore sampled 
from the colonist populations in L. Vaggatem and 
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Table 1. Locations and codes samples included in the genetic analysis of the northern European vendace invasion (Fig. 2), with latitude and lon- 
gitude (position), sample date, water drainage, lake size and height of location in water course, date of translocation (for L. Kelujarvi and L. 
Sinettajarvi to L. Inari) or first observation on locality (for L. Inari, L. Vaggatem and L. Skrukkebukta), secchi depth, and sample size (W). 

Seech i 

Water course Area Height Year depth 
Lake Code Position Sample date (Ocean)* (km 2 ) (m) (Observed) (Max depth, m) N 



L. Kelujarvi 


Ke 


67°28'N, 27°4'E 


14.11.2008 


Kemijoki (BS) 


9 


187 


1956 


2(10) 


L. Sinettajarvi 


Si 


66°35'N, 25°25'E 


25. 10.2008 


Kemijoki (BS) 


9 


97 


1964-1966 


3(40) 


L. Inari 


In 


69°10'N, 27°55'E 


30.10.2008 


Inari-Pasvik (AO) 


1102 


118 


1973 


5(95) 


L. Vaggatem 


vg 


69°13'N, 29°14'E 


09.09.2008 


Inari-Pasvik (AO) 


34 


52 


1991 


3(30) 


L. Skukkebukta 


Sb 


69°33'N, 30°7'E 


16.09.2008 


Inari-Pasvik (AO) 


7 


21 


1993 


6(38) 



*BS, Baltic Sea; AO, Arctic Ocean. 

L. Skukkebukta in the Norwegian part of the Inari-Pasvik 
watercourse to investigate the genetics of the secondary 
expansion. All samples were collected in the autumn 2008 
near or at the spawning grounds and consisted of >92% 
ripe individuals. There are seven hydropower dam con- 
structions located below L. Inari (Fig. 2). More detailed 
descriptions of L. Inari and the Pasvik watercourse are 
given by Mutenia and Salonen (1992) and Amundsen 
et al. (1999), respectively. 

Microsatellite DNA amplification and 
genotyping 

Genomic DNA was extracted from gill filaments by E-Z96 
Tissue DNA Kit (OMEGA Bio-tek, Norcross, GA) follow- 
ing the manufacturer instructions. The individuals were 
genotyped at 10 microsatellite loci (Table SI) arranged in 
three multiplex polymerase chain reactions (PCR) follow- 
ing a previously described protocol (Praebel et al. in press). 
The PCR products were separated on an ABI 3130 XL 
Automated Genetic Analyzer (Applied Biosystems, Foster 
City, CA) and alleles scored in the GeneMapper 3.7 soft- 
ware (Applied Biosystems). After the first validation of the 
genotypes, 3-4% of the individuals within each population 
were re-extracted and rerun at all ten loci. The genotypes 
resulting from the initial run and the rerun were manually 
compared for all individuals to rule out miss-scoring of 
alleles. If any doubt occurred in this comparison the sam- 
ples were re-extracted and rerun at all loci to obtain a con- 
sensus genotype. The samples were finally screened for 
abnormalities (null alleles, scoring errors, etc.) in the soft- 
ware MICRO-CHECKER 2.2.3 (Van Oosterhout et al. 
2004), using 1000 bootstrap replications to generate the 
expected homozygote and heterozygote allele size differ- 
ence frequencies. 

Statistics 

The within population genetic variation indices; number of 
alleles (N A ), expected (H e ) and observed (H Q ) heterozygos- 



ity, and the fixation index (F IS ) were estimated in GenAlEx 
6.41 (Peakall and Smouse 2006; Table SI). Deviations from 
Hardy-Weinberg equilibrium (HWE) for each population 
and locus and linkage disequilibrium (LD) among loci and 
among loci over all populations were tested by exact tests 
(Guo and Thompson 1992) using GENEPOP 4.0 (Rousset 
2007). The tables of pair-wise P- values from the LD and 
HWE tests were corrected for multiple comparisons by 
sequential Bonferroni corrections (BFCs) following Rice 
(1989). Allelic (Nra) and private allelic richness (N RPA ) per 
population were determined, accounting for differences in 
sample-sizes, using the rarefaction procedure for the small- 
est sample size (100 genes) as implemented in the software 
HP-RARE 1.0 (Kalinowski 2005). 

To infer which of the two initial vendace stockings 
(L. Sinettajarvi or L. Kelujarvi) in L. Inari that led to suc- 
cessful colonization of the Inari-Pasvik watercourse, a 
neighbor-joining tree was build using Nei et al. (1983) 
genetic distance (D a ) and nodes were tested for robust- 
ness by 1000 bootstraps using Populations 1.2.32 (Langel- 
la 2005) and viewed in TREEVIEW (Page 1996). 
A sample of native vendace (N = 20) obtained from 
northern Germany was used to assist the clustering of the 
neighbor-joining tree. Genetic divergence between the 
possible source populations and the introduced L. Inari 
population as well as among the colonist populations 
were estimated by pair-wise F S r (Weir and Cockerham 
1984) values and tested for statistical significance (10,000 
permutations) using ARLEQUIN 3.5.1.2 (Excoffier and 
Lischer 2010). The table of P- values for the pair- wise _F ST 
values was corrected for multiple comparisons by BFCs 
following Rice (1989). Bayesian clustering as implemented 
in STRUCTURE 2.3.2 (Pritchard et al. 2000; Hubisz et al. 
2009), was used to provide another estimate of origin and 
population structure of the data. We used a model 
assuming admixture and correlated allele frequencies 
between K populations (Burn-ins of 100,000 replications 
and 300,000 Markov chain Monte Carlo (MCMC) 
replicates). Sampling locations were used as a priori infor- 
mation to assist the structuring (the LOCPRIOR model) 
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as recommended for weak signals of structuring (Hubisz 
et al. 2009). All runs were replicated 10 times at each 
K = 1-5 to confirm consistency of log-likelihood proba- 
bilities. The most likely (highest In Vr(J^K)) grouping 
was visualized using STRUCTURE HARVESTER (Earl 
and vonHoldt 2012). Structuring of the vendace popula- 
tions were also estimated by principal component analysis 
(PCA). The ordination of the individuals was performed 
in the program GenAlEx 6.41 (Peakall and Smouse 2006). 
Finally, we investigated whether shared private alleles 
could be identified in pair-wise comparisons of the source 
L. Kelujarvi and/or L. Sinettajarvi and the introduced 
population (L. Inari), as well as among populations 
within the watercourse which would support any link 
found in the population structuring approach. The rare- 
faction procedure implemented in ADZE 1.0 (Szpiech 
et al. 2008) were used to estimate the pair-wise shared 
private allelic richness across loci for the possible source 
populations and the introduced population (L. Inari) and 
for the secondary expansion within the water course (i.e., 
between L. Inari-L. Vaggatem, L. Inari-L. Skrukkebukta, 
and L. Vaggatem-L. Skrukkebukta). All estimates were 
performed using a standardized sample size correspond- 
ing to the smallest sample (100 genes). 

We tested whether the initial stocking and the subse- 
quent invasion was associated with founder effects by 
testing for genetic bottlenecks among the populations 
using the software BOTTLENECK 1.2.02 (Cornuet and 
Luikart 1996). Population bottlenecks will cause a tempo- 
rarily imbalance in the mutation-drift equilibrium, where 
additions of new alleles via mutation are balanced by the 
loss of alleles via drift (Luikart and Cornuet 1998). We 
utilized this assumption to identify situations where the 
actual sample heterozygosity exceeds a permuted equilib- 
rium heterozygosity as expected under a mutation- drift 
equilibrium. If the actual heterozygosity exceeds the equi- 
librium heterozygosity it is indicative of a recent popula- 
tion bottleneck (Cornuet and Luikart 1996). We used 
1000 coalescent simulations and assumed a two-phased 
model of mutation (TPM) and the more conservative 
step-wise mutation model (SMM). The statistical signifi- 
cance of the deviations at equilibrium and observed het- 
erozygosities were tested with Wilcoxon signed-rank tests. 
We also tested for genetic signatures of founder effects 
following the invasion and the secondary expansion, by 
estimates of changes in the effective population size (N e ). 
Changes in the effective population size will indicate the 
relative contribution of genetic drift (Hedrick 2000). N e 
was estimated for all populations using OneSamp 1.1 
(Tallmon et al. 2008). This software uses approximate 
Bayesian computation to estimate variance N e from sum- 
mary statistics that are related to N e . We used prior 



upper and lower bounds for N e of 2-1000 and 10,000 
replications to generate the 95% credible intervals. 
Finally, we correlated the pair-wise -Fst's and the differ- 
ence in expected heterozygosity between the source popu- 
lation and Inari-Pasvik populations (AH e ) with 
geographic distance and years from translocation to 
occurrence. This was performed to reveal spatial and tem- 
poral patterns of the mechanism underlying the founder 
event and to estimate the time of the possible expansion 
from L. Inari downstream the Pasvik watercourse. We 
expected increased differentiation (pair-wise -F S t's) and 
difference in genetic diversity (AH e ) the longer the dis- 
tance and time since divergence. The correlation of geno- 
type data and geographical distance (physical divergence) 
from the source population to each of the populations in 
the Inari-Pasvik watercourse were compared with the cor- 
responding correlations for the three possible expansion 
times. The time from introduction to the founding of the 
L. Inari population was in all estimates set to 8 years, 
whereas the expansion time from L. Inari downstream the 
watercourse was tested for initial sighting of vendace 
larvae (1973), total cover of lake (1980), and peak (1989) 
(Table S2). We used partial Mantel tests as implemented 
in GenAlEx 6.41 (Peakall and Smouse 2006), to test these 
scenarios, using 9999 permutations to obtain significance. 

Results 

Genotyping, validation, and quality control 
of genotypic data 

We did not identify any mismatch between the original 
individual multilocus genotypes and the re-extracted 
3-4% replicates within the present dataset. Heterozygote 
deficits were indicated by MICRO-CHECKER at BWF1 
(L. Vaggatem, L. Skrukkebukta), C2-157 (L. Skrukke- 
bukta), Cocl-lav06 (L. Sinettajarvi, L. Inari, L. Vaggatem, 
and L. Skrukkebukta) Cocl-lavlO (L. Sinettajarvi, L. Inari, 
L. Vaggatem, and L. Skrukkebukta), Cocl-lav49 (L. Skruk- 
kebukta), ClaTet06 (L. Kelujarvi), ClaTetl3 (L. Sinettajarvi 
and L. Skrukkebukta) all indicated as caused by the pres- 
ence of null alleles. To test whether the loci with potential 
null alleles may affect the results the STRUCTURE analy- 
ses were performed as described in the Material and Meth- 
ods but without the loci showing the most heterozygote 
deficits across populations (BWF1, Cocl-lav06, and Cocl- 
lavlO). Neither the number of inferred clusters nor the 
population structure was found to differ when comparing 
the results with and without these loci (Figs. SI vs. 4). 
Given the relatively few loci used in the study we therefore 
maintained all loci in the full analysis to ensure statistical 
power. 
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Genetic variation 

The standard indices of within population genetic variation 
are given in Tables 2 and SI. We discerned 124 alleles 
among the ten microsatellite loci assayed in the five studied 
vendace populations, with a within population variation of 
47 to 89 alleles. Mean number of alleles (N A ) per locus per 
population varied from 2 to 25 and expected (H e ) and 
observed heterozygosity (H 0 ) per locus per population 
varied from 0.019 to 0.916 and 0.019 to 0.895, respectively. 
All populations, except L. Kelujarvi, showed significant 
departures from HWE associated with heterozygote defi- 
cits. Linkage disequilibrium was identified for 4 out of 45 
pair-wise locus tests across populations but all returned 
non-significant after sequential BFC. The individual locus 
tests displayed 18 out of 50 and 7 out of 50 significant devi- 
ations from HWE before and after BFCs, respectively 
(Table SI), and 2 to 9 significant LD's within each popula- 
tion (only three significant after BFC, all in L. Sinettajarvi). 

The allelic richness were almost double in L. Sinet-tajarvi, 
L. Inari, L. Vaggatem, and L. Skrukkebukta (Nra = 7.9-8.8) 
compared to L. Kelujarvi (Nra = 4.6) (Table 2). Private alle- 
lic richness varied between -Nrpa = 0.11-0.97 with the lowest 
Nrpa found in the lower Pasvik river lakes. H e and H Q were 
also higher in L. Sinettajarvi, L. Inari, L. Vaggatem, and L. 
Skrukkebukta (H e = 0.596-0.614; H Q = 0.506-0.546) com- 
pared to L. Kelujarvi (H e = 0.400; H 0 = 0.369) (Table 2). 

Identifying the source for the vendace 
invasion and population structure within 
the secondary expansion 

The pair-wise P S t estimates of genetic differentiation 
between the two possible source populations 
(L. Kelujarvi/L. Sinettajarvi) and L. Inari suggest that L. 
Sinettajarvi was the source population as this population 
pair display the lowest, although significant, differentia- 
tion (L. Kelujarvi vs. L. Inari, F sr = 0.158, P < 0.0001; L. 
Sinettajarvi vs. L. Inari, F ST = 0.011, P = 0.0027; Table 
S3). L. Kelujarvi moreover appeared to be the most 



isolated population of the five populations studied 
(F ST = 0.141-0.158; P < 0.0001). Within the Inari-Pasvik 
watercourse, the introduced population (L. Inari) was 
significantly different from the populations from the 
secondary expansion (L. Vaggatem and L. Skrukkebukta), 
whereas the populations from L. Vaggatem and L. Skruk- 
kebukta could not be significantly discriminated (Table 
S3). The sequential BFCs did not change the significance 
level for any of P-values from the pair-wise F ST estimates. 
The PCA plot revealed groupings of (i) L. Kelujarvi and 
(ii) L. Sinettajarvi, L. Inari, L. Vaggatem, and L. Skrukke- 
bukta (Fig. 3). Each axis, PCI and PC2, explained 26.6% 
and 24.4% of the total variation, respectively. 

The Bayesian clustering revealed that the individuals 
could be partitioned into two genetic clusters (K = 2; In Pr 
(X\K) ± SD = -7209 ± 1; Fig. S2), separating L. Kelujarvi 
into one and L. Sinettajarvi, L. Inari, L. Vaggatem, and L. 
Skrukkebukta into the other cluster (Fig. 4). The combined 
approach of a neighbor-joining tree with Nei et al. (1983) 
genetic distance (DJ provided support for L. Sinettajarvi 
being the most likely source for the vendace in the Inari- 
Pasvik watercourse. Moreover, the neighbor-joining tree 
also provided support for divergence between the founder 
population of L. Sinet-tajarvi and the introduced popula- 
tion in L. Inari (bootstrap support of 100%) as well as 
between L. Inari and the populations from the secondary 
expansion (L. Vaggatem and L. Skrukkebukta, 98%), which 
is also supported by the low, but significant f ST values. 
Finally, the Bayesian clustering and the neighbor-joining 
tree grouped L. Vaggatem and L. Skrukkebukta together 
supporting the non- significant F ST value estimated between 
these populations. 

A higher mean number of shared private alleles were 
found within the L. Sinettajarvi-L. Inari pair compared to 
the L. Kelujarvi-L. Inari pair (Fig. 5a). The mean number 
of private alleles shared by the L. Kelujarvi-L. Sinettajarvi 
pair resembled that of the L. Kelujarvi-L. Inari pair, sup- 
porting the conclusion that L. Sinettajarvi is the source 
population for the invasion. For the secondary expansion 
the pair-wise comparison of mean number of shared pri- 



Table 2. Summary genetic statistics of the vendace populations included in the study. 



Sample 




H e 


H 0 


Wa 


W RA 


Wrpa 


TPM 


SMM 


We 


Ke 


0.0772 


0.400 


0.369 


47 


4.6 


0.79 


0.084 


0.003 (d, 0.002) 


55 (38-153) 


Si 


0.1133* 


0.614 


0.544 


89 


8.6 


0.97 


0.322 


0.019 (d, 0.009) 


234 (130-988) 


In 


0.0870* 


0.597 


0.546 


89 


8.8 


0.73 


0.193 


0.007 (d, 0.003) 


250 (150-746) 


vg 


0.0849* 


0.596 


0.546 


81 


7.9 


0.11 


0.625 


0.019 (d, 0.009) 


150 (93^122) 


Sb 


0.1664* 


0.607 


0.506 


79 


8.4 


0.58 


0.275 


0.003 (d, 0.002) 


200 (112-703) 



The coefficient of inbreeding (F ls ), genetic diversity estimates (HJH a ), total number of alleles (W A ), mean allelic (Wra), and private allelic richness 
(Wrpa). The P-values for the Bottleneck test are given (TPM and SMM) and significant values (SMM) were only found significant for heterozygote 
deficit (denoted with d and the corresponding P-value for the exact test). The effective population sizes (We) with credible intervals are given. 
Sample codes as in Table 1. 

*Significant heterozygote deficit (positive F,s) estimated by Hardy-Weinberg exact tests. 



) 2013 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. 



1405 



Invasion Genetics of Vendace 



K. Praebel et al. 



i Ke ■ Si a In ♦ Vg a Sb 



O 

CL 



A 


• • 

* 9 ° ° ° ' . 


-i » «5 V/^ 

■ 

A .* ^ ^ 


JA A 



PC 1 (26.6%) 

Figure 3. Principal component analysis (PCA) plot of the genetic 
structuring among the five vendace populations. PC1 and PC2 explain 
26.6% and 24.4% of the total variation, respectively. Sample 
abbreviations as in Table 1. 



vate alleles show that the L. Inari-L. Vaggatem pair share 
fewer private alleles than the L. Vaggatem-L. Skrukkebukta 
pair, with the L. Inari-L. Skrukkebukta pair sharing the 
lowest number of private alleles (Fig. 5b). This suggests a 
stepping-stone dispersal where L. Inari vendace colonized 
L. Vaggatem and the population in L. Vaggatem subse- 
quently founded the population in L. Skrukkebukta 
(Fig. 2b). Some specific alleles (e.g., allele 270 at ClaTetl3, 
allele 190 at Cocl-lav49, but see Fig. 2b) were also observed 
only in the L. Sinettajarvi, L. Inari, L. Vaggatem, and L. 
Skrukkebukta populations and display an expanding pat- 
tern, while others (e.g., allele 189 at ClaTet06) were present 
in all populations but in much higher/lower frequencies in 
L. Kelujarvi compared to the other four lakes (Fig. 2b). 

Founder effects and patterns of spatial and 
temporal genetic variation 

None of the populations showed significantly higher 
expected heterozygosity (H e ) than equilibrium heterozy- 
gosity (H eq ) in the analysis of bottlenecks assuming a 
TPM. Similar results were revealed when assuming a 
SMM. All estimates of effective population sizes were asso- 
ciated with overlapping CIs (Table 2). However, the esti- 
mate of L. Sinettajarvi and L. Inari did not differ in 
magnitude, supporting that the introduction of vendace 
into the system was associated with limited founder effects 
(drift). A decreasing trend in effective population size 
(although all CIs overlapped) was revealed from L. Inari 
to L. Skrukkebukta suggesting an increased influence of 
drift and restricted gene flow during the expansion. 

All spatial and temporal correlations of genotypes were 
found to be significant (Table 3). The period of peak 
abundance of vendace (1989) in L. Inari also correlate the 
best with patterns of genetic expansion (AH e ), suggesting 
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Figure 4. Genetic structure among L. Kelujarvi (Ke), L. Sinettajarvi 
(Si), L. Inari (In), L. Vaggatem (Vg), and L. Skrukkebukta (Sb) as 
revealed by a neighbor-joining trees using Nei et al. (1983) genetic 
distance (D a ) and Bayesian clustering of all individuals using 
STRUCTURE (Hubisz et al. 2009) assuming two genetic clusters of 
individuals (K = 2). Only bootstrap resampling percentages above 50 
are shown for the neighbor-joining tree. In the STRUCTURE analysis 
black lines separate individuals from different sampling sites (labeled 
right) and each individual is represented by a thin horizontal line, 
which is partitioned into /C-colored segments representing individual's 
estimated membership fractions in K clusters. The absolute mean 
values of In Pr(X|/f) are plotted for K = 1-5 in supplementary Fig. S2. 

that the secondary expansion downstream the watercourse 
occurred late in the invasion process and that it was associ- 
ated with genetic constrains. This is also in accordance with 
the observation of vendace in catches in Pasvik; vendace 
were first observed in the upper Pasvik in 1989, and then 
successively downstream the watercourse (Vaggatem in 
1991 and Skrukkebukta in 1993; Amundsen et al. 1999). 

Discussion 

Primary source of the Inari-Pasvik vendace 
invasion 

There were two possible sources for the introduction and 
subsequent downstream invasion of vendace in the Inari- 
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Sample size (genes) 

Figure 5. The mean number of shared private alleles private to 
location pairs of L. Kelujarvi-L. Inari (Keln), L. Sinettajarvi-L. Inari (Siln), 
and L. Kelujarvi-L. Sinettajarvi (KeSi) (a) and within the Inari-Pasvik 
watercourse (L. Inari-L. Vaggatem, InVg; L. Inari-L. Skrukkebukta, 
InSb; and L. Vaggatem-L. Skrukkebukta, VgSb) (b) as a function of 
standardized sample sizes (in genes). Error bars represent ±SEM 
across the 10 microsatellite loci. 



Table 3. Spatial and temporal correlations of pair-wise differences 
(Fst) and differences in expected heterozygosity (AH e ) using partial 
Mantel tests. 





Fst 




AH e 






P 




P 


Geographical distance 


0.548 


0.041 


-0.987 


0.040 


Time| n itjal_observation 


0.778 


0.045 


-0.623 


0.040 


Timej 0 ta|_ CO verage 


0.682 


0.045 


-0.817 


0.042 


Ti HI ep ea k_ab Linda nee 


0.483 


0.042 


-0.912 


0.041 



The coefficient of regression (R xy ) and the corresponding P-value 
(9999 permutations) are given. Geographical distances and expansion 
times used for correlations are given in Table S2. AH e , Pair-wise differ- 
ence in heterozygosity between L. Sinettajarvi and the populations 
within the Inari-Pasvik watercourse. 



Pasvik watercourse (Mutenia and Salonen 1992; Salonen 
1998). Multilocus microsatellite data and several statistical 
approaches clearly identified L. Sinettajarvi vendace as the 
most likely founder population of Inari-Pasvik vendace. 
The low, but significant, pair-wise F ST value found 
between the vendace populations of L. Sinettajarvi and 
L. Inari indicates that these populations have diverged 
recently. In comparison, the F ST value between 
L. Kelujarvi and L. Inari was high and represents the typi- 
cal level of genetic divergence reported between other 



native postglacial coregonid populations in northern Fen- 
noscandia (0stbye et al. 2006; Saisa et al. 2008; Praebel 
et al. in press). Moreover, support for L. Sinettajarvi 
being the source population for the invasion was found 
in the combined approach of a neighbor-joining tree and 
Bayesian clustering, where genetic homogeneity was 
shown for L. Sinettajarvi and the Inari-Pasvik watercourse 
populations whereas L. Kelujarvi formed its own cluster. 
Bayesian clustering and neighbor-joining trees have suc- 
cessfully been used in other studies for inference of routes 
and sources of biological invasions (see review by Estoup 
and Guillemaud 2010), and supportive results, as seen 
herein, provide strong evidence for the revealed pattern. 
L. Sinettajarvi and L. Inari also shared a higher number 
of private alleles than L. Kelujarvi and L. Inari, which 
provide additional support for L. Sinettajarvi being the 
founders, as time since introduction is too short for many 
new private alleles to accumulate. Addressing the number 
of shared private alleles between pairs of populations has 
been used in several other studies to identify the origin of 
invasions and migrations (e.g., Szpiech et al. 2008; Bell 
and Matocq 2011). It is also worth considering that the 
similarity in genetic variation of L. Sinettajarvi and 
L. Inari provide additional support for this origin, as 
biological invasions very seldom are associated with an 
increase in genetic variation in the introduced population 
compared to the source population (Nei et al. 1975; 
Dlugosch and Parker 2008b). Thus, taken together, there 
is compelling evidence that the second introduction of 
vendace in 1964-1966 from L. Sinettajarvi represents the 
source of the vendace invasion and establishment in the 
Inari-Pasvik watercourse. 

Founder effects in the initial Inari-Pasvik 
vendace invasion 

Multiple introductions have been shown to retain genetic 
diversity (Kolbe et al. 2004; Facon et al. 2008; Geller 
et al. 2010) and thus increase the likelihood of successful 
colonization and demographic expansion of the invader 
(e.g., Dlugosch and Parker 2008b; Shine 2012). In the 
present study the successful source population for the 
invasion was identified as the one that was introduced 
several times in 1964-1966 to a small lake upstream L. 
Inari. The population that apparently did not succeed in 
establishing was introduced to L. Inari by escapees from 
a local hatchery (Mutenia and Salonen 1992). This 
implies that the propagule pressure for this source likely 
has been smaller and/or that the population was presum- 
ably less adapted for the watercourse compared to the 
repeated introductions from L. Sinettajarvi. In fact, the 
results show that the L. Inari population has similar alle- 
lic richness and heterozygosity to the source population. 
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We did not identify loss of genetic variation via genetic 
bottlenecks for the introduced L. Inari population, which 
have been shown to be the case for other biological inva- 
sions (e.g., Tsutsui et al. 2000; Henry et al. 2009; Ayres 
et al. 2010). Similar effective population size estimates in 
the L. Sinettajarvi and Inari-Pasvik populations also sug- 
gested little genetic drift. Thus, the initial founding event 
of vendace in the Inari-Pasvik watercourse may be asso- 
ciated with no or limited founder effects. However, we 
show that the source and the invasive populations are 
genetically different, which suggest that genetic drift 
actually has played a role in the establishment of the col- 
onists, since time from introduction is too short for 
mutations to have accumulated across the microsatellite 
loci in high enough frequencies to have effect on the 
result. 

Dispersal pattern and genetic structure of 
the secondary expansion 

The results revealed that the populations resulting from 
the secondary expansion downstream from L. Inari have 
diverged genetically from the introduced population in a 
stepping stone pattern. Such a dispersal pattern maintains 
genetic variation and limits divergence compared to other 
expansion patterns (e.g., Ibrahim et al. 1996; Reusch et al. 
2010; Tonione et al. 2011), due to the allelic patchiness 
and likely subsequent gene flow among demes. The popu- 
lations within the secondary expansion have become 
genetically distinguishable from the L. Inari population in 
about 18 years/9 generations (maximum 35 years/ 17 gen- 
erations if the Pasvik populations were founded already 
in 1973 at the time of first observation in L. Inari, but 
this scenario appears highly unlikely from the observed 
occurrences of vendace in the watercourse; see Amundsen 
et al. 1999). This is remarkable, especially because the 
high genetic variation and large consensus and effective 
population size of the colonist population effectively 
should counteract genetic divergence. In marine fish spe- 
cies, such as herring (Clupea harengus), capelin (Mallotus 
villosus), and Atlantic cod (Gadus morhua), it is well 
known that large population sizes with high genetic varia- 
tion and no obvious barriers to gene flow effectively ham- 
per the build up of reproductive isolation (Knutsen et al. 
2003; Mariani et al. 2005; Praebel et al. 2008; but see 
review by DeWoody and Avise 2000). The major differ- 
ence and the most likely explanation for the rapid diver- 
gence observed among the vendace populations within 
the Inari-Pasvik watercourse therefore appears to be the 
presence of several hydroelectric dam constructions 
between the L. Inari population and the two downstream 
populations from the secondary expansion (i.e., L. Skruk- 
kebukta). These dams may effectively hamper downstream 



gene flow and promote the build up of reproductive 
isolation during the expansion. 

Adaptive and evolutionary changes in the 
invader - perspectives 

Identifying the source population and reconstructing the 
routes of biological invasions are crucial for handling and 
managing invasive species, as well as for gaining knowledge 
of the ecology behind successful colonization events. How- 
ever, invasions also offer the possibility to study evolution- 
ary processes such as natural selection and life history 
changes as they unfold. For example, invaders may evolve 
in response to an altered selection regime compared to 
their native range (Facon et al. 2006; Sax et al. 2007; Bacig- 
alupe 2009; Shine 2012). The Inari-Pasvik watercourse is 
300-380 km north of L. Sinettajarvi, but at similar altitude. 
Thus, the summer is shorter and colder, and the ice-cov- 
ered period longer. Especially, temperature is an important 
driver for adaptive changes as it will affect, for example, 
egg incubation time, growth, physiological processes 
(Q10), and general individual fitness (e.g., Mooney and 
Hobbs 2000). In addition, r-strategists are often favored 
during biological invasions (Lodge 1993; Facon et al. 
2006), and our earlier study suggests that these invasive 
populations in the secondary expansion areas have speeded 
up their life history via growth and age at sexual maturity 
(Amundsen et al. 2012). These changes happened within a 
decade and the system therefore apparently represents an 
excellent example of rapid adaptive evolution. Future stud- 
ies of this system may therefore gain valuable insights by 
identifying adaptive phenotypic traits (transcriptome) and 
the genetic basis for local adaptation (e.g., using pheno- 
typic QTLs) (see e.g., Bernatchez et al. 2010). Moreover, 
insights would be gained from investigating variation at 
genes and genomic blocks related to important life history 
parameters such as growth, metabolism, and disease resis- 
tance, to infer adaptive changes or patterns of plasticity 
among the Sinettajarvi-Inari-Pasvik populations. 
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Supporting Information 

Additional Supporting Information may be found in the 
online version of this article: 

Table SI. Summary statistics for each locus and popula- 
tion in the source and invader vendace (Coregonus albula) 
populations from northern Europe. 

Table S2. Geographical distances and expansion times of 
vendace populations used in the partial Mantel test. 
Table S3. Estimates of genetic differentiation (F S t) 



between the five populations of vendace in northern Eur- 
ope. 

Figure SI. Summary of results from STRUCTURE for 
inferring K populations among the tested vendace popu- 
lations when omitting three loci that may be associated 
with null alleles. 

Figure S2. Summary of results from STRUCTURE for 
inferring K populations among the tested vendace popu- 
lations. 
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